Link Prediction in the Yelp Social and Review Networks

نویسنده

  • Lucas Finn
چکیده

In this project, we apply two predictive network algorithms to the Yelp academic challenge dataset. The first algorithm is adapted from Leskovec, Huttenlocher and Kleinberg (2010) to predict the rating that a user will assign to a business [1]. We adapt the notion of signed edges in the network to account for both useruser edges (friendships) as well as user-business edges (reviews). A logistic regression model is trained from features in the social and review network data, and experiments are carried out using ten-fold cross validation. We find that the ability to predict a user’s rating of a business depends highly on the prior ratings given by the user and the prior ratings received by the business. While a user’s rating is impacted by the ratings given by his or her friends, this influence appears to be a second-order effect. In addition, the logistic regression classifier achieves a moderate improvement over a baseline classifier. The second algorithm is adapted from Backstrom and Leskovec (2011) to predict the friends that a user will link to in the future [2]. We extract eleven features from the Yelp dataset and separate friendships into training and testing sets. The training edges remain in the social network; the testing edges are removed. We then optimize a personalized weight vector for each user with gradient descent using a supervised random walk. We find that 56% of the testing edges are ranked in the top 20 recommendations for 300 randomly selected users. Because the candidate set can contain thousands of possible edges, this result is moderately significant. The applications of these predictive models in the Yelp data are twofold: the first algorithm enables a user to predict their future evaluation of a business; equivalently a business can search for potential new clients. The second algorithm enables a friendship recommendation engine, which improves the user experience of the Yelp service, which benefits both users and businesses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Link Prediction in Bipartite Networks - Predicting Yelp Reviews

In this paper, we aim to predict new user reviews on businesses in the Yelp social network. We formulate this as a network link prediction problem by modeling Yelp dataset as a bipartite network between users and businesses. We implement link prediction algorithms with various proximity metrics, thoroughly evaluate the effectiveness of each algorithm and conclude that Delta, AdamicAdar and Comm...

متن کامل

A Link Prediction Method Based on Learning Automata in Social Networks

Nowadays, online social networks are considered as one of the most important emerging phenomena of human societies. In these networks, prediction of link by relying on the knowledge existing of the interaction between network actors provides an estimation of the probability of creation of a new relationship in future. A wide range of applications can be found for link prediction such as electro...

متن کامل

Group 14 Predicting Yelp Reviews using Internal Links - Link Prediction in Bipartite Networks

Social Networks may exist as bipartite graphs which evolve with new links being continuously added between the two disjoint set of nodes. To understand the dynamism of such bipartite networks, it is useful to predict links between existing nodes. Using the Yelp social network between users and businesses, we attempt to predict user reviews on businesses and the rating of predicted reviews. In o...

متن کامل

Providing a Link Prediction Model based on Structural and Homophily Similarity in Social Networks

In recent years, with the growing number of online social networks, these networks have become one of the best markets for advertising and commerce, so studying these networks is very important. Most online social networks are growing and changing with new communications (new edges). Forecasting new edges in online social networks can give us a better understanding of the growth of these networ...

متن کامل

پیشگویی پیوند در شبکه های اجتماعی با استفاده از ترکیب دسته بندی کننده ها

Abstract Link prediction in social networks is one of the most important activities in analysis of such networks. The importance of link prediction in social networks is due to its dynamic nature. While members and their relationships (links) in such networks are continuously increasing, links may be missed due to various reasons. By predicting such links, the possibility of extension, compl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014